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SOME    AFELIGATIGNS   Of    BRUNS'S    "KOLLEKTIVMASSLfcHttfil11 . 

I.  INTRODUCTION. 

Mathematical  statistics  is  much  concerned  with  the  description  of 
statistical  data  by  means  of  functions  representing  curves  which  such 
distributions  if  plotted  tend  to  follow.  Obviously,  that  system  is  use- 
less ,  that  will  not  give  approximately  the  same  frequencies  as  the  orig- 
inal data,  even  tho  the  hypotheses  under  which  the  original  theory  is  de- 
rived seem  most  plausible.  I.  Palin  Elderton  in  1909  presented  before 
the  International  Congress  of  Mathematicians  a  paper  entitled  nA  Compari- 
son of  Some  Curves  used  for  graduating  Chance-distributions."  He  compares 
by  actual  goodness  of  fit  certain  frequency  distributions  when  treated 
by  different  systems  of  graduation  and  suggests  that  this  method  as  well 
as  the  plausibility  of  the  assumptions  in  the  derivation  of  the  theory 
should  be  a  basis  of  comparison  of  the  efficiency  of  different  types  of 
curves..  He  suggests  that  an  investigation  of  the  relative  capacities  of 
various  curves  from  the  point  of  view  of  graduation  will  be  helpful  to 
the  mathematician,  not  only  aiding  him  in  determining  the  relative  merits 
of  different  systems  but  also,  by  showing  the  failure  in  actual  practice, 
enabling  him  to  discover  the  weaknesses  in  the  initial  assumptions.  It  is 
the  purpose  of  the  present  paper  to  make  some  applications  of  Bruns's 
series  and  to  compare  the  results  with  the  results  obtained  by  an  appli- 
cation of  a  Pearson's  curve. 

The  earliest  successful  curve  was  the  normal  or  Gaussian  function 
y=e*x  ;  this  curve  is  a  description  of  the  binomial  (o+(7)n  when  p  and  q 


are  approximately  equal  or  when  n  is  very  large.  It  has  been  observed 
that  this  curve  is  applicable  to  the  representation  of  much  scientific 
data,  especially  to  problems  in  the  biological  sciences.  Many  writers 
have  endeavored  to  show  that  it  is  the  law  of  nature  and  to  verify  their 
theory  by  illustrations  showing  its  goodness  of  fit1.  Different  instances 
of  failure  have  lead  to  studies  by  Charlier2,  Pearson3,  Bruns4,  Thiele5, 
Kapteyn",  and  Edgeworth7  in  the  fitting  of  given  data  by  functions  other 
than  the  normal  curve.  Xarl  Pearson  has  derived  a  set  of  seven  equations 
from  the  differential  equation 

X»iu  a   £-i.2  

y  dx  '  K+"Jc+brJr*  ' 

The  normal  curve  is  a  special  case  of  this  equation  obtained  when  5  t=&;s=0, 

Thiele  as  early  as  1339  discussed  the  fitting  of  data  by  means  of  a 
series  and  in  1903  oublished  in  his"Theory  of  Observations"  a  discussion 
of  the  series 

F(x)  =  Any(x)  +  (x)  +  A4yiv(x)  +  

1  Karl  Pearson,  Phil.   Mag.,   ser.    5,   vol.   50,   p.  171. 

2  C.V.P.   Charlier,   Researches  into  the  Theory  of  Probability",  Meddel- 

anden  fran  Lunds  Astror.omiska  Observator ium,   Lund,  1906.. 

3  Karl  Pearson,   Skew  Variation  in  Homogeneous  Material,  Phil.  Trans.., 

A,   vol.    186,    pp.   S4S  et  seq.    On  the  Systematic  Pitting  of  Curves  to 

Obsisrvat  ions  and  Measurements.,    Biom.  ,    v.  I, pp.  265  et  seq. , v. II, pp. l 

et  seq, 

4  H  Bruns, Wahr scheinl ichkeit srechnung  und  Kollektivmasslehre.Leipzity.905, 
8  T.   N.   Thiele,   Theory  of  Observations,   London,  1905. 

6  J. C.   Kapteyn,    Skew  Frequency-Curves  in  Biology  and  Statistics,  Gronin- 

gen,  1903. 

7  P. Y. Edgeworth,   Journal  of  Statistical  Society,   vol. 61, pp. 675-689, Camb. 

Phil.    Trans.,   vol.    20,    pp.   36-65,  113-141. 
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Chafclier  used  not  only  this  equation  which  he  called  type  A  but  also 
another  which  he  called  type  B.  So  far  no  criterion  as  to  when  A  or  B 
should  be  used  has  been  developed;  Charlier  himself  seems  to  use  B  when 
A  is  unsuccessful  • 

Edgeworth  has  also  discussed  at  considerable  length  a  generalized 
form  of  this  equation. 

Bruns  has  studied  this  form;  he  is  interested  in  it  especially  as  a 
means  of  fitting  an  arbitrary  distribution.  His  very  rigorous  development 
of  the  function  is  published  in  hisnWahrscheinlichkeitsrechnung  und  Xol- 
lektivmasslehre";  he  has  also  calculated  tables  that  are  of  great  assis- 
tance in  the  use  of  the  series.  R.  v.  Mises  of  Strassburg  published  in 
1912  in  "Jahresbericht  der  Deutschen  Mathematiker  Vereinigung"  a  develop- 
ment of  the  series  very  clcsely  following  the  development  of  a  Fourier's 
series.  A  brief  statement  of  this  modification  of  the  method  of  Bruns 
follows. 

II.  BRIEF  STATEMENT  OF  MISES  MODIFICATION  OF  METHOD  OF  BRUNS. 

The  exponential  function,  cp,=  -l'e~**y  and  its  derivatives  form  the 
different  terms  of  the  series.  The  coefficients  of  these  terms  in  the 
series  are  to  be  determined.  It  will  be  convenient  to  indicate  the  suc- 
cessive derivatives  of  <pt  by  q>.¥,  q>?,  <p4,  ....  Then 

/ff$c  =  -2xe~ 

/n<Qi  =  (-2  +  4jfc2)e-«2 

✓ff(T4  =  (12*  -  8#)e-x* 
 i  iiii  ........ 

W. P. Elderton, Atti  del  IV  Congresao  I nternazionale  del  Matematici,  S,p. 

2  37 


4. 


where  •      "a  3  J*  . 

If  we  let  the  polynomials  -  factors  of  e~x    -  of  degree  y-1  be  repre 
sented  by  pv  then  the  following  recursion  formula  is  evident: 

Let  a    =  -2s:  

Then  «t=l;  «,  =  -*  ;  «,  =  -  1  ♦         «4  =  *  .  |3;  «5  *  -|  -  *g  +  |1  ; 

.    =  ^  JL  ♦  *1  -  -*i  5  a,  «  *  -1.  +  *-  -  *1  +  Jtlm 

*•         32     24      120       7         384      64     96  720 

It  is  obvious  that  the  term  of  highest  cower  in  av  is  of  degree  y-1  and 
that  it  has  a  coefficient  . 


The  value  of  the  integral,  £ x%j^udxf or  different  values  of  y  and  u 


is  next  to  be  discussed.  If  u>v  then  by  integration  by  parts  we  obtain: 

where  denotes  the  derivative  of  o^.  The  integrand  vanishes  in  the  last 
term  since  <*v  is  of  degree  y-1. 

If  v>u,  differs  from  oucpv  by  a  constant.  Therefore  in  this  case 

also  £2&v$udx    =  °>  that  is»  whenever  y  /  y,  the  integral  j^a^d*    -  0. 

But  if  y  =  y 

The  (y-l)st  derivative  of  <xv  is  a  constant. 

*kjPt<**    =  =  *5  =  1. 

The  formula  for  the  series  may  be  written 

fix)  =  ct?t  +  fif.»sfo  ♦  c„<Pr»  +  c4<r4  +  c5cp5  +  ca<?e  * 
in  which  cv  are  constants  to  be  determined.  Assuming  such  valfces  for  c 
that  the  series  is  uniformly  convergent,  we  may  multiply  by  <p    and  inte- 


grate  term  bp  term.  According  to  the  theory  just  developed  all  the  terms 

oc 

except  the  yth  will  vanish  and  we  have  cv=£ae/'«vdjc.  The  analogy  with  the 
coefficients  of  the  terms  in  a  Fourier's  series  is  apparent. 

Remembering  the  values  already  determined  for  <*v  and  using  also, 
where  Mv  represents  the  uth  moment  about  the  mean,  A'o=£*/*dx,  M t=/"  xfdx, 
£fi*£*x's/,£*»  etc.,  one  may  readily  determine  the  values  for  cv. 

Ce  =  ,1m    .  Ijf,  +  Am    ;  c    =  -  li  +  la  ^  As.  •  q=  -  JLjf  +1*  -l4*Jti, 

5      32  *      8  *      24  4       *         32       2      120  '    7        384  "  64    96  720 

By  means  of  the  Sturm-Licuville  theory6,  Mises  then  shows  the  con- 
vergence of  the  q>  series  and  states  the  following  theorem: 

If  fix)  is  a  continuous  function  of  limited  oscillation  which  be- 
comes zero  of  at  least  the  third  order  at  infinity,  then  the  series 

f(x)    =   CtVi   +   Cptp.p   +    C^p  +  

with  the  coeff  icients  c^l^f^dx  is  uniformly  convergent  in  the  entire 
interval  and  represents  the  function  in  that  interval. 

He  states  further: 

If  a  real  function  with  all  its  derivatives  vanishes  at  infinity 
and  with  these  derivatives  builds  a  succession  of  normal  functions  of  a 
differential  equation  of  the  second  order,  then  this  function  n.ust  be  of 
the  form:  u  =  Ke"         where  K  and  h  are  arbitrary  real  constants.. 

If  a  frequency  distribution  to  be  approximated  by  a  Bruns  series  be 
given,  there  are  two  arbitrary  values  to  be  determined:  1)  the  origin  or 
point  from  which  measurements  are  made,  and  2)  the  range  of  the  sub-class 
or  the  unit  of  grouping.  If  a  be  the  arithmetic  mean,  then  for  z-x-a  the 
S2§££i2i§Qii_.2£-. wi*l  vanish,  that  is,  fzfdx=Q.  Bruns  so  determines 
*  Journal  des  Mathemat iaues ,   tomes  I  and  II. 
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ht  the  unit  of  grouping  that  the  coefficient  of  cp,  vanishes;  if  u-hzyther\ 

3  -  \ffdz  *  fy*ffz'*dz 

Here  c«<  vanishes  when    ft'*  =  =  -i-  where  o  is  the  standard  devia- 

2  fz£dz 

tion. 

With  these  values  for  h  and  z  the  coefficients  of  the  series 
fix)  -  ct<p,  +  c.^p  +  c.,<?s  + 

then  become: 

et  sh  ;  c,  =  0  ;  c.,  =  0  ;  c«  =  -  ^3i/fl  ;  c5  =  -  |-  +  ib*4**  » 

r     =  -  &f.4Ct  •  c     =  Am  -  +  4 la 5.  . 

c«        24'        120       7      192       96  720 

Bruns  writes  the  series: 

2s(x)  -  1  «  Drf(v)  +  ftitteli  *  tdSkli  +  2**Ula  ♦  

<6  <i  ^ 

*  2         2  2      22         2  2s 


where  0**1,0.,  =  -  SS*!*,  etc. 

o 

If  we  compare  this  series  with 

fix)  3  CtTt  +  c4<p4  +  c5cps  +  c«q?e  +  ••••• 

we  find 


fix) 

=  s(*) 

q?(y) 

-  If**-*'*** 

=  -l(e-t'£)m  «p4  ■  -lie'**) 

2      2"*  4H4 


ttr 
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Since  Bruns  has  calculated  tables  of  values  of  <p(i>),  ?(u)t,  ^||*»« 

2  2^ 

iL^'ll  the  numerical  work  is  simplified  by  U3ing  his  form  of  the  equation. 
2 

Having,  thus,  a  method  for  determining  the  coefficients  of  the  terms  of 

the  series  we  naturally  inquire  whether  or  not  the  approximations  of  f{x) 

secured  by  the  use  of  these  coefficients  will  satisfy  a  criterion  of  fit 

such  as  the  "least  squares"  condition;  this  condition  is  that 

flf(x)  -  1I1  ciyi{x)l*dx  ■ 

shall  be  a  minimum.  This  quantity  as  written  is  not  a  minimum;  however, 

■s 

if  it  be  weighted  with  e      which  really  gives  more  weight  to  the  devia- 
tions far  from  0,  then 

will  be  a  minimum  for  the  values  of  c  which  have  been  determined;  for, 
the  value  of  cit  for  which  this  function  is  a  minimum,  is  found  by  settins 
the  first  derivative  of  the  function  with  respect  to  cv  equal  to  0  and 
solving  for  cv. 

=  flf  (x)  -    S    c  q>  ]cp  e**dx  =  0 

^iJ^f^x)  -  i|1  c^Je^x  =  0,  where  k  is  a  constant, 
Vi*  -  c]  =  0 

or  q    =/*  fix) agxwhich  is  the  formula  by  which  the  coefficients  are  de- 
termined. 

III.  DISCUSSION  0?  NUMERICAL  DISTRIBUTIONS. 

The  frequency  distribution  for  which  a  theoretical  curve  is  obtained 
by  means  of  a  Bruns  series  is  taken  from  the  Medico-Actuarial  Mortality 
Investigation,  volume  I:,  pages  42  and  43.  The  table  gives  a  distribution 
of  heights  of  men  between  the  ages  of  25  and  30  and  weighing  138  pounds 
or  above..  The  table  below  gives  the  observed  frequencies  and  the  theore- 


■ 
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tical  frequencies  when  Pearson's  Type  IV  curve  is  used;  also  the  frequen- 
cies obtained  by  the  use  of  one,  three  or  five  terms  of  the  Bruns  series, 
ffhen  one  term  only  is  used,  we  obtaiii,  of  course,  the  distribution  resul- 
ting under  the  application  of  the  normal  curve.  In  the  table,  x  designates 
the  mid-points  of  classes  or  the  class  marks xt\  f(x)t  ,  the  observed  dis- 
tribution; fix)z  ,  the  theoretical  distribution  using  one  term  of  the 
series;  the  theoretical  using  three  terms;  f(x)A,  the  theoretical 

using  five  t6rms;  f(x)x9  the  theoretical  obtained  by  Pearson's  Type  IV 
Curve. 


X  - 

A,  . 

f(x ) 

fix)*. 

fix)* . 

f  v  X  l  £  • 

60 

1 

0 

0 

0 

0 

61 

0 

0 

0 

0 

0 

62 

s 

0 

1 

2 

1 

6S 

1 

1 

3 

5 

2 

64 

2 

2 

7 

9 

5 

65 

10 

10 

18 

11 

14 

66 

37 

33 

33 

20 

39 

67 

91 

87 

75 

65 

77 

66 

173 

183 

198 

176 

160 

62 

261 

237 

214 

277 

287 

70 

403 

442 

443 

436 

424 

71 

511 

455 

493 

450 

439 

72 

457 

397 

429 

391 

427 

73 

228 

273 

281 

235 

281 

74 

152 

157 

141 

166 

143 

75  .  . . 

70 

58 

70 

59 

When  f(x)9,  f(x)^f  ...  refer  to  "sum-  functions",  they  include  the 
entire  class   for  which  X  is  a  class  mark. 

The  zeros  used  here  mean  that  the  theoretical   values  are  less  than 

0.5. 
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X, 

fix)  t. 

78 

31 

26 

22 

20 

21 

77 

6 

8 

8 

4 

7 

78 

2 

2 

3 

1 

2. 

In  determining  the  above  frequencies  the  values  of  the  moments  about 
the  centroid  are  first  computed;  since  there  is  high  contact  at  the  ends 
of  the  range  Sheppard's  corrections  are  applied.  The  resulting  moment 
coefficients12  are: 

Mo  =  1  ;  #i  =  0  !        =  4.466288  ;  jf,  3  -1.73056;  tf4  =  72.828685  ; 

tf5  =  -140.354144  ;N6    =  2585.761355. 
These  values  are  substituted  in  the  formulae  developed  in  part  II  of  the 
present  paper. 

Karl  Pearson  in  the  Philosophical  Magazine,  series  5,  volume  50, 
cage  157,  under  the  head  of  "On  the  Criterion  that  a  given  System  of  De- 
viations from  the  Probable  in  the  Case  of  a  Correlated  System  of  Varia- 
bles is  such  that  it  can  be  reasonably  supposed  to  have  arisen  from  Ran- 
dom Sampling"  gives  the  criterion  of  fit  which  is  used  to  test  these  dif- 
ferent distributions.  Hllderton's  Tables,  Biometrika,  volume  I,  page  155, 
are  used  in  making  the  numerical  calculations.  We  give  here  the  method 
of  calculation  of  P,  the  probability  that  the  deviation  from  the  probable 
of  the  theoretical  curve  can  reasonably  be  suoposed  to  have  arisen  from 
random  sampling,  in  the  case  of  Pearson's  curve  and  also  in  the  case  of 
the  normal  curve.  Here,  as  in  each  comDutation  of  this  character,  the 
variate  at  60  is  omitted  from  consideration.  By  Chauvenet's  criterion 
this  v§riate_dQes_not  rightfully  belong  to  the  distribution  for  nurooses 

12 

Bp  moment  coefficient  is  meant  the  moment  divided  by  the  total  fre- 
quency. 
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of  computation. 


X, 

r  r 

LSesSiJ 
■ 

62 

3 

1 

-2 

4 

4.  r 

63 

1 

2 

1 

1 

.5 

64 

2 

5 

3 

9 

1.3 

65 

10 

14 

4 

16 

1.143 

66 

37 

39 

2 

4 

.103 

67 

91 

77 

-14 

196 

2*546 

68 

173 

160 

-13 

169 

1.056 

69 

261 

287 

26 

676 

2. 355 

70 

408 

424 

16 

256 

.604 

71 

511 

439 

22 

434 

.99 

72 

457 

427 

SO 

900 

2. 108 

73 

226 

231 

53 

2809 

9.996 

74 

152 

143 

-9 

31 

—  ^  /-> 
•  ooo 

75 

64 

59 

-5 

25 

.424 

76 

31 

21 

-10 

100 

4.762 

77 

6 

7 

1 

1 

.143 

78 

2 

2 

0 

0 

0. 

33.096 

X'*  =  33.096  ,  n 1  =  13,  P  =  .013539. 
For  notation  of  this  and  the  following  table  see  page  8  and  Philo- 
sophical Magazine,  ser.  5,  vol.  50,  p.  157. 


>2 

x.  ruii.     r (*r-»;).  (*r-<)'2.  lwr»tL* 

82  3               0            -3  9 

63  110  0 

64  2  2  0  0 
85  10              10             0  0 

66  37  33            -4  16  .485 

67  91  87            -4  16  .184 
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f (*) «. 

f  (x)  ■  . 

(flZ  ~w  f )  • 
p  r 

r  r 

68 

173 

183 

10 

100 

.546 

69 

261 

237 

26 

676 

2.355 

70 

408 

442 

34 

1156 

2.615 

71 

511 

455 

-56 

3136 

6.892 

72 

457 

397 

-60 

3600 

9.06 

73 

228 

278 

50 

2500 

8.993 

74 

152 

157 

5 

25 

.159 

75 

64 

70 

6 

36 

.514 

76 

31 

26 

-5 

25 

.962 

77 

6 

3 

2 

4 

.5 

76 

2 

2 

0 

0 

.0 

It  is  to  be  noted  that  in  the  case  of  the  Pearson's  curve,  the 
largeness  of  the  value  of  x*  is  due  to  the  difference  between  the  observ- 
ed and  theoretical  values  at  73.  The  observed  distribution  is  irregular 
at  this  point.  If,  now,  this  value  should  be  considered  as  eaual  to  the 
largest  of  the  remaining  factors  determining  x'2*  then  x'e  has  the  value 
28.021  and  p-0, 04472,  which  means  a  fair  fit,  for  under  random  sampling 
we  may  expect  such  a  set  of  deviations  four  times  in  a  hundred. 

When  the  normal  curve  is  used,  the  tails  lead  to  indefinitely  large 
values  of  x'2»  For  tnis  reason  we  have  grouped  together  the  frequencies 
at  the  tails  until  groups  containing  at  least  10  variates  are  obtained. 
Using  this  method  we  find  x2=24.744  for  the  Pearson's  curve  and  x'£=33.285 
for  the  normal  curve.  Since  n1  is  now  IS,  we  have  for  the  Pearson's  curve 
p=0. 016235  and  for  the  normal  curve  p=0. 001898. 

Using  this  same  method,  we  obtain  for  the  distribution  determined 
by  using  three  terms  of  the  series,  P=0. 000087;  and  for  the  distribution 
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obtained  by  the  use  of  five  terms,  P=0. 000000. 

It  is  interesting  to  note  here  that  while,  in  general,  distributions 
of  stature  may  be  well  fitted  with  the  normal  curve,  ia  this  particular 
instfance  -  the  distribution  being  that  of  men  of  specified  age  and  weight- 
the  approximation  obtained  with  the  normal  curve  is  certainly  not  good, 
while  a  closer  approximation  is  obtained  with  the  Pearson's  Type  IV  curve, 

Bruns  maintains  that  for  many  purposes  sum-functions  are  preferable 
to  the  "verteilungs"*3  functions.  A  sum-function  has,  of  course,  this  ad- 
vantage that  it  is  a  monotone  increasing  function.  The  sum-functions  as 
given  by  one,  three,  or  five  terms  of  the  series  are  given  here14; 


• 

60 

1 

*■ 

o1* 

o 

0 

61 

1 

0 

0 

1 

62 

4 

1 

o 

63 

5 

1 

3 

7 

64 

7 

o 

11 

16 

65 

17 

13 

29 

27 

66 

54 

46 

66 

47 

67 

145 

134 

141 

111 

63 

318 

317 

340 

288 

69 

579 

603 

554 

584 

70 

987 

1044 

1002 

1051 

71 

1498 

1500 

1495 

1501 

72 

1955 

1897 

1924 

1891 

73 

2183 

2175 

2205 

2177 

74 

2335 

2332 

2346 

2342 

a  ■ 

verteilungs*' 

function  is 

meant  a 

frequency 

explanation  of 

table,  see 

page  S. 

The  zeros  used  here  mean  ttvat  the  theoretical  values  are  less  than 
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X  • 

T\ xi 9» 

T  \ x  J 

f  I  y\ 
T \X  1  4« 

75 

2399 

2403 

2404 

2412 

76 

2430 

2428 

2425 

2432 

77 

2436 

2436 

2434 

2436 

78 

2438 

2438 

2437 

2437. 

The  results  of  the  application  of  the  criterion  of  fit  to  these 
distributions  are  not  comparable  to  the  results  obtained  upon  its  appli- 
cation to  the  "verteilung",  since  in  this  case  the  value  of  Xz  *s  affect- 
ed not  only  by  the  frequency  for  each  particular  value  of  x  but  also  by 
all  the  values  that  have  preceded  it.  The  application  of  the  criterion, 
however,  makes  possible  a  comparison  of  the  different  approximations  and 
the  observed  distribution.  For  the  curves  resulting  when  one,  three,  or 
five  terms  of  the  series  are  used,  the  probabilities  that  the  deviations 
of  the  theoretical  curves  from  the  probable  can  reasonably  be  supposed 
to  have  arisen  from  random  sampling  are:  0.013043,  0.163433,  and  0.015921 
For  the  curve  using  terms  to  «p(u)4,  if  the  variates  at  62  be  omitted 
from  consideration,  the  value  of  y(*  is  13.6243  and  the  value  of  P, 
0.694736,  which  indicates  an  excellent  fit.  For  the  case  of  the  normal 
curve  and  the  five-term  approximation,  the  larger  values  for  the  determi- 
nation of  x'£  are  not  at  the  tails. 

The  Bruns  series  is  convergent  and  we  reasonably  expect  to  find  a 
closer  approximation  to  the  original  data  by  using  five  terms  of  the 
series  than  by  using  three.  The  poorer  fit  may  be  explained,  however,  by 
considering  the  very  high  probable  errors  of  higher  moments.  For  the 
problem  in  question,  the  error  in  the  sixth  moment  was  found  to  be  13.65 
per  centum  of  this  moment.  This  error  was  determined  with  the  variate  at 
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60  omitted.  It  would  undoubtedly  have  been  much  larger  had  that  variate 
been  considered. 

Another  problem  has  also  been  taken  up  for  consideration.  The  data 
is  from  the  Medico-Actuarial  Mortality  Investigation,  volume  I,  page  71, 
and  consists  of  the  weights  of  5646  women  five  feet  and  four  inches  tall. 
The  table  gives  the  observed  frequencies  and  the  theoretical  frequencies 
where  one,  three,  or  five  terms  of  the  series  are  used1*. 


75 

0 

1 

0 

0 

80 

0 

4 

0 

0 

85 

1 

12 

0 

0 

90 

1 

29 

4 

0 

95 

10 

65 

24 

2 

100 

84 

130 

33 

100 

105 

228 

231 

206 

211 

110 

462 

368 

400 

482 

115 

542 

522 

830 

636 

120 

863 

861 

813 

765 

125 

897 

747 

835 

799 

130 

744 

753 

797 

740 

135 

551 

878 

610 

701 

140 

422 

543 

400 

387 

145 

259 

389 

244 

312 

150 

232 

248 

159 

137 

155 

112 

141 

120 

108 

160 

100 

72 

97 

68 

185 

58 

33 

73 

50 

170 

42 

13 

48 

39 

175 

.  15 

4 

27 

27 

**  For  explanation  of  tables  Bee  page  3. 
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X0 

f(x)Q. 

f(x)A. 

**• 

180 

12 

2 

13 

17 

185 

4 

0 

5 

9 

190 

5 

0 

2 

4 

195 

1 

o 

1 

1 

200 

3 

0 

0 

1. 

Having 

given  these 

results,  one  can  easily 

deduce  by  numerical  cal- 

culation  that 

in  this  case 

also  the 

approximation  obtained  by  the  use  of 

three  terms 

of 

the  Bruns 

series  is 

better  than 

that  obtained  by  the  use 

of  five.  It 

is 

also  better 

than  that  obtained  by  the  use  of  the  normal 

curve. 

The  following  table 

gives  the 

sum-functions  for  this  problem17. 

X. 

f  (*);*. 

f(x\*. 

f(x)A. 

75 

0 

1 

0 

0 

80 

0 

2 

0 

0 

85 

1 

6 

0 

0 

90 

2 

17 

4 

0 

95 

12 

46 

28 

2 

100 

96 

240 

111 

102 

105 

324 

471 

317 

313 

110 

786 

839 

717 

795 

115 

1328 

1361 

1347 

1431 

120 

2191 

2022 

2165 

2197 

125 

3088 

2769 

3050 

2996 

130 

3832 

3523 

3848 

3736 

135 

4383 

4200 

4458 

4437 

140 

4805 

4743 

4858 

4824 

145 

'V  — 

5064 

5132 

5102 

5136 

17  For  explanation  of 

tables  see 

page  S. 
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X* 

150 

5296 

5380 

5260 

5323 

155 

5408 

5521 

5380 

5431 

160 

5508 

5593 

5477 

5499 

165 

5564 

5626 

5550 

5549 

170 

5606 

5639 

5598 

5588 

175 

5621 

5643 

5624 

5615 

180 

5633 

5645 

5638 

5632 

185 

5637 

5646 

5643 

5640 

190 

5642 

5646 

5645 

5644 

195 

5643 

5646 

5646 

5645 

200 

5646 

5646 

5646 

5646. 

Here  again  the  approximation  obtained  by  the  use  of  three  terms  is 
better  than  that  obtained  by  the  nofcmal  function;  while  it  is  also  better 
than  that  obtained  when  five  terms  of  the  series  are  used,  it  should  be 
remarked  that  the  difference  is  due  to  extreme  variates  and,  if  these  be 
eliminated  from  consideration,  the  goodness  of  fit  is  very  nearly  the 
same* 

IV.  CONCLUSIONS. 

The  result  of  the  application  of  the  Bruns  series  in  Problem  I  is 
less  probable  than  that  obtained  with  the  Pearson's  Type  IV  curve.  There 
is  this  point  to  be  considered,  however:  the  numerical  calculations  in- 
volved in  the  application  of  a  Pearson's  curve  are  laborious,  especially 
when  Type  IV  is  required  and  ordinates  must  be  calculated.  The  use  of  the 
tables  that  Bruns  has  calculated  render  the  application  of  the  Bruns 
series  comparatively  easy. 

It  would  be  very  natural  to  infer  from  statements  about  the  applica- 
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bility  of  the  Bruns  method"  to  arbitrary  distributions  that  the  succes- 
sive terms  of  this  expansion  in  series  gradually  approach  our  observed 
distributions.  The  results  of  the  present  paper  show  the  unexpected  fact 
that  three  terms  of  the  series  in  each  of  the  cases  here  considered  give 
a  better  approximation  than  five  terms.  The  same  situation  in  this  respec 
obtains  for  both  the  applications  that  have  been  here  made. 

Bruns, Wahrscheinlichkeitsrechnung  und  Kollektivmasslehre,   p.  111,. 


